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Abstract — The physical-layer network coding (PNC) approach 
provides improved performance in many scenarios over "tra- 
ditional" relaying techniques or network coding. This work 
addresses the generalization of PNC to wireless scenarios where 
network nodes have multiple antennas. We use a recent ma- 
trix decomposition, which allows, by linear pre- and post- 
processing, to simultaneously transform both channel matrices 
to triangular forms, where the diagonal entries, corresponding 
to both channels, are equal. This decomposition, in conjunction 
with precoding, allows to convert any two-input multiple-access 
channel (MAC) into parallel MACs, over which single-antenna 
PNC may be used. The technique is demonstrated using the 
two-way relay channel with multiple antennas. For this case it 
is shown that, in the high signal-to-noise regime, the scheme 
approaches the cut-set bound, thus establishing the asymptotic 
network capacity. 

Index Terms — network modulation, physical-layer network 
coding, network capacity, multiple access channel, structured 
codes, nested lattices, MIMO channels, two-way relay channel 



I. Introduction 

The capacity region of networks is a long-standing problem 
in Information Theory. Wireless networks are of special prac- 
tical interest. Traditionally, such networks were treated at two 
different levels: A physical-layer local code which translates 
the wireless channels into "bit-pipes", and a network code over 
the bit network. This separation is in general sub-optimal, as 
was demonstrated in recent years by physical-layer network 
coding (PNC) approaches. 

In the most basic variant of PNC (for single-antenna 
nodes), relay nodes simply forward their inputs using power 
adjustment only ("amplify-and-forward"). This analog-PNC 
approach helps in opening network bottlenecks, by allowing 
a node to assist in transmission even if it cannot decode the 
message, and indeed it is optimal in some cases, e.g., some 
limits of the parallel relay network JT], @. However, analog- 
PNC suffers from noise accumulation: Without decoding, the 
relays also forward noise. Structured-PNC is an alternative 
approach which solves both the bottleneck and noise accumu- 
lation effects at the same time. It uses structured/linear codes, 
building on the property that an integer linear combination of 
codewords is a codeword as well. A relay node may be able 
to decode the combination codeword, even if it cannot decode 
the individual messages. It was first presented by Wilson et 
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al. [3) for the simple two-way relay channel. Nam et al. J4] 
have shown that such a scheme achieves a rate, within half a 
bit of the cut-set bound, for any channel coefficients. A more 
general network is treated by Nazer and Gastpar's "compute- 
and-forward" strategy ||5l . 

Wireless communication widely uses multiple-input- 
multiple-output (MIMO) techniques to obtain degrees 
of freedom. It is therefore natural to ask, what can the 
combination of PNC and MIMO achieve over a network 
where terminals have multiple antennas. While some works 
considered MIMO versions of analog-PNC (see, e.g., J6)- 
|8 |), little was done in the context of structured-PNC. In Q, 
Zhan et al. demonstrate that in the two-way relay channel, 
structured-PNC can gain from multiple relay antennas - but 
the work is restricted to single-antenna terminal nodes, not 
allowing to fully enjoy MIMO gains. 

In this work we introduce a way to combine structured- 
PNC and MIMO transmission, where a relay node receives 
the linear combination of two transmit nodes via MIMO 
channels. The approach we propose builds on a recently 
introduced matrix decomposition [10], that allows to simulta- 
neously transform two channel matrices into triangular form 
with equal diagonals, by applying unitary operations at both 
sides. The transformation is applied to the channel matrices 
in conjunction with dirty-paper precoding at the transmitters, 
which yields parallel channels. 

An essential property of the matrix decomposition, which 
allows to independently apply structured-PNC to each sub- 
channel without loss, is that the ratio between the gains of 
both transmitters is fixed for all the parallel channels. We note 
that the two-way MIMO relay channel problem was recently 
also considered in ifTTl . where it was proposed to use the 
the generalized singular value decomposition (GSVD) Ifl2l 
to transform the matrices into triangular form. However the 
GSVD fails to achieve a fixed ratio. Thus, in general it yields 
sub-optimal results. 

The rest of the paper is organized as follows. In Section [TT] 
we present the problem and the main result. In Section [HI] 
we provide a constructive proof using the aforementioned 
decomposition. We conclude in Section [IV] by presenting 
performance comparisons and discussing some extensions. 

II. The Two-Way MIMO Relay Channel 

The two-way relay channel consists of two terminals and a 
relay. We define the channel model as follows. Transmission 



takes place in two phases, each one, w.l.o.g., lasting N channel 
uses. At each time instance in the first stage, terminal i 
(i = 1,2) transmits a signal Xi >n and the relay receives 
Y n according to some memoryless multiple-access channel 
(MAC) Wmac(Y\Xi, X2). At each time instance in the second 
phase, the relay transmits a signal X n and the terminals 
receive Y^ n according to some memoryless broadcast (BC) 
channel Wbc(^1i ^I^O- Before transmission begins, terminal 
i (i = 1,2) possesses an independent message of rate Ri, 
unknown to the other nodes; at the end of the two transmission 
phases, each terminal should be able to decode, with arbitrarily 
low error probability, the message of the other terminal. The 
closure of all achievable pairs (Ri,R 2 ) is the capacity region 
of the network. 

We consider a Gaussian MIMO setting, where terminal i 
(i = 1,2) has N t -i transmit antennas and the relay has N r 
receive antennas, during the MAC phase, which is depicted in 
Figure [Ta] Denoting by bold letters the transmit and receive 
vectors, we have the MAC channel: 

Y = Mi + H 2 X 2 + Z , 

where Hi are N r x Nt-i matrices, Z is circularly-symmetric 
white Gaussian noise with unit variance and the inputs are 
subject to the same total power constraint: 
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i = 1,2. 



We assume that the number of transmit antennas at each node 
N t -i is at least as large as the number of receive antennas N r , 
and that the matrices Hi and H 2 are full-rank, i.e., have rank 
AV0 We further assume that the messages have the same rate 
Ri = R 2 = R and that the products of the non-zero singular 
values, of each of the channel matrices, are equal to 1, or 
equivalently: 
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(1) 



The exact nature of the BC channel is not material in the 
context of this work. We characterize it using its common 
message capacity C common . 

First consider the single-antenna case, N r = Np t i = N t . 2 = 
1. It is shown in that one can achieve a rate of: 
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(2) 



By the min-cut theorem, one cannot achieve a rate greater than 
the point-to-point capacity of the MAC links or the common- 
message capacity of the BC channel: 



Res = min {log (1 + P) , C common } 



(3) 



It follows that PNC is optimal in the high-SNR limit. It is 
interesting to compare these to common relaying approaches. 
Using decode-and-forward (D&F) relaying, the relay must 

'For the cases in which the matrices are not full-rank or have more receive 
antennas see Section |TVl 

2 Wilson et al. (5) considered a real-valued network, thus an additional 1/2 
pre-log factor was present in their original expression. 




Terminal 1 Terminal 2 

(a) MIMO MAC phase. 





Relay 



Terminal 1 Terminal 2 

(b) BC phase for the special case where it is a MIMO BC channel. 

Fig. 1: MIMO two-way relay channel. The second (BC) phase 
may be different in general. 

decode both messages with sum-rate 2R. Instead of forwarding 
both message it can use a network-coding approach and XOR 
them, then each terminal can XOR out its own message to 
obtain the desired one. The resulting rate is given by: 



RdF = min <j - log (1 + 2P) , Ccommon 



(4) 



In the high-SNR limit, when the MAC stage is the bottleneck, 
the D&F rate is about half the cut-set bound. In this scenario, 
an approach that we denote "amplify-and-forward" (A&F) 
can be used, where the relay forwards the noisy sum of the 
terminal transmissions^ This approach achieves: 



R AF = log(l + aP) . 



(5) 



where a = a(P, C com mon) is a noise amplification factor which 
is strictly smaller than one for channels of finite capacity. It 
follows that the A&F rate has a finite difference from the cut- 
set bound in the high-SNR limit. 

Moving back to the MIMO case, the cut-set bound is given 
by: 



Res = min {Ci,C 2 ,C common } , (6) 



where 



d = max logdet(7 + H t C x H}), f = 1,2. 

trace(C x )<P 1 

In the high-SNR limit, under the normalization ([T), we have: 



lim d-NAog^- =0. 



(7) 



The next theorem, which is proved in Section [Hi] shows that 
the PNC rate (|2]i generalizes to the MIMO case as follows. 

3 Note that it does not necessarily involve analog amplification, if the BC 
links are not AWGN channels. Instead in that case we compress the sum of 
the codewords, and treat the unbiased quantization noise as channel noise. 



Theorem 1: The capacity of the Gaussian MIMO two- 
way relay network with full-rank channel matrices, with 



iV t , -/Vj 2 "* > N r satisfying (|T), is lower-bounded by 



A^log-^Cc, 



(8) 



On account of (0, the rate i?PNC approaches the cut-set 
bound for high SNR as in the single-antenna case. High-SNR 
conditions hold when 



N r 



> 1 



i = 1,2 

j = l,...,N r 



(9) 



where {Aj ; j} are the singular values of Hi. 

Theorem assumes the use of white input. At any finite 
SNR, it can be improved by optimizing over the input covari- 
ance matrices, however, this rate is already better than the rate 
achievable by the D&F approach, except for very low SNR. 

III. Structured MIMO-PNC Scheme 

In this section we provide a constructive proof for the main 
result, stated in Theorem Q] The key ingredient is using the 
matrix decomposition of iflOl Sec. IV] to obtain equivalent 
parallel single-antenna networks, over which we can use the 
structured-PNC strategy of |3]. The decomposition is given 
in the following theorem of ifTOl . transposed in order to 
accommodate for the MAC setting. 

Theorem 2: Let A\ and A2 be complex-valued full-rank 
matrices, of dimensions m x n\ and m x ri2, respectively, 
such that n.1,71.2 > m (meaning Ai are of rank m). If the 
products of their singular values are equal, then A\ and A2 
can be jointly decomposed as 



Ai = UT x vl 
A 2 = UT2V2 , 



(10) 



where V\, V2 and U are unitary matrices of dimensions 
m x ni, n-2 x ri2 and m x m, respectively; and T\ and T2 
are generalized lower-triangular matrices (matrices with zero 
entries above the main diagonal, i.e., Ti-ij = and T^-ij = 
for i < j, where T^kj denotes the (k,j) entry of Ti) with 
positive equal diagonal elements. 

We apply the decomposition of Theorem [2] to the channel 
matrices Hi and H2- and denote the diagonal entries of 
T\ (which are equal to those of T2 by assumption (HJ) by 

tl, ■ ■ ■ , tiy r . 

We use dirty-paper precoding at each of the terminals to 
cancel the off-diagonal elements of Xi and T2- This results 
in N r parallel channels, with gains given by the (equal for 
both terminals) diagonal elements. Denote these elements of 
Ti (which are also equal to those of T2) by t\,... ,tN r . If 
we use each such channel as part of an independent single- 
antenna two-way relay network with input power P/N r , we 
can achieve a total rate of: 
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(11) 



where is the rate conveyed over sub-channel k, and (a) 
holds true due to (fJJ. 

We now describe in detail a scheme which allows to achieve 
this rate. 

Codebooks generation: Apply the decomposition of Theo- 
rem 12 (where we set ?i; = and m = N r ) to the channel 
matrices Hi and H2 to obtain matrices U, V\, V2, Ti and T2. 
At each terminal, divide the message into N r sub-messages of 
rate r k = R mc / -/V r +log^0Now for each k (k = 1, . . . , N r ), 
both terminals use the same nested-lattice code [13] with 
nesting ratio r k - Denote the coarse lattice (common for all 
sub-messages) by A, and the fine lattices by {Afc}. The second 
moment of A is P/N r , A is assumed to be Rogers-good, while 
{Afc} are good for AWGN coding. 

MAC encoding: Terminal i chooses a lattice point for 
the fc-th submessage|| then sequentially computes: 



A 



i,k 



-i.k 



k-1 



i;k,jXij 



mod A . 



(12) 



Finally, the transmit signal is 

X,; = Vi"X-i . 

MAC decoding: The relay computes Y = W~Y and then 
uses lattice decoding to find £ k , the closest point of A^ 
(modulo A), and recovers messages {nik} according to the 
codebook mapping. 

BC stage: The relay conveys the messages {nik} to both 
terminals, using any capacity-approaching common-message 
BC scheme. 

Final decoding: The terminals recover {£k}, then compute 
for each submessage: 



where 



a f 2 i = l 
1 i = 2 



.1 mod A . 



(13) 



This estimated codeword is mapped back to a message. 

Proof of Theorem\J} Since A is Rogers good with second 
moment P/N r and Vj are orthogonal, the transmission satisfies 
the power constraint. Now, 



Y = U\HiVi±i + H2V2X2 
= T1X1+T2X2 + Z, 



Z) 



4 If this quantity is negative for some k, set = 0; since this only improves 
the achievable rate, we can assume for the sake of analysis that it is always 
positive. 

s ii k and the signals that follow are n-dimensional vectors; we still reserve 
boldface to denote spatial (antenna) dimension 1 , . . . , N r . 




Fig. 2: Rates as a function of the MAC SNR P in Example Q] 
where the MAC section is the bottleneck. Dotted black line - 
i?cs (cut-set bound); red continuous line - -Rpnc; dashed 
magenta line - Ruf\ dashed-dotted blue line - time-sharing 
between PNC and D&F. 



where Z = U. By the precoder operation (fTZt we have: 



Yk 
tk 



mod A 



4 + -f- 



mod A . 



where £k = \ii,k + ^2,fe] mod A, and Z has the same dis- 
tribution as Z. Since if. is a valid codeword of Afc/A, this 
codeword effectively passed through a mod-A channel with 
SNR P fc = t 2 k P/N r . Moreover, since A fe is good for AWGN 
coding at rate logPfc, we have that Ik = Ik with arbitrarily 
low error probability (as the dimension n —¥ oo). Thus, the 
relay may decode the messages {mfc}, and the total rate of 
the latter is 

P 



R 



AUoe 



N r 



1,2. 



Of course, if C common is lower than that, we can achieve 
the lower rate. Now, these messages can be conveyed to 
the terminals with arbitrarily low error probability using rate 
Common- If there was no error in any of the stages, then 
( Tol l yields Tv Lk = T- i k ; by our choice of codebooks, this 
corresponds to a transmission rate -Rpnc- ■ 

Example 1: Figure |2] depicts the achievable rate obtained 
when the channel matrices are: 



Hi 



1/2 




H> 




1/2 



and the MAC phase is assumed to be the bottleneck, compared 
to the cut-set bound and to the D&F rate. As expected, the 
proposed scheme approach the upper bound as the MAC SNR 
increases. 

At this point, one can understand why -Rpnc of Theorem Q] 
is higher than that of Yang et al. HTT1 . The GSVD-based 
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scheme suggested there induces equivalent triangular channels 
with different diagonals, thus suffering substantial loss when 
applying the scalar PNC scheme. In fact, as the GSVD 
provides the "most spread diagonal ratios" (see ifiOl ). a scheme 
that does not apply any precoding matrices Ui should have 
better performance than the GSVD one. 

IV. Extensions 

Time-sharing. As is true also for the SISO case, D&F is 
better than PNC at low rates; for example, it yields a positive 
rate for any positive Pk, while PNC fails for Pk < 1 due to 
the loss of the "1" (see expressions (|2]i and (fTTT i). As noted for 
the single-antenna case in yj, one may use time-sharing in the 
intermediate region, to improve over both. Figure |2]compares 
the different bounds, as a function of the MAC SNR which is 
assumed to be the bottleneck in this case. 

Non full-rank matrices. In the case when the channel 
matrices Hi and H 2 are not full-rank or when there are more 
receive antennas than transmit ones, other strategies needed to 
accompany the approach proposed in this work. Consider, for 
instance the following two channel matrices: 



Hi = 



In this case the relay sees two parallel channels, from each of 
the two terminals. Note that in this case a good strategy would 
be to sum the two channel outputs at the relay, to decode the 
sum-codeword and send it to the two terminals during the 
BC phase, or alternately to decode each of the messages, sum 
them (modulo lattice) and send the result over the BC channel 
(it is more efficient to send a common message than two 
private messages). For more general non full rank matrices, 
such strategies can be combined with the strategy suggested 
in Section [ill] as well applying the joint triangularization of 
Theorem [2] to augmtend versions of the channel matrices as 
is done in IfiOl . 

Use of non zero-forcing elements. We have used the lattice 
codes in a "naive" manner, achieving Rk = log Pp.. That is, 
we lost the "entire" 1. Indeed, using dithering and MMSE 
estimation as in [3| one can achieve the higher rate RpNC,k 
(|2). This will improve performance at low signal to noise ratios 
(though not extremely low, as RpNC,k is only positive for 
Pk > 1/2). Furthermore, instead of the "zero-forcing" decom- 
position proposed in the current work, an MMSE version could 
be used, which decomposes augmented versions of the channel 
matrices along with using non-white covariance matrix, as is 
done in the common message Gaussian MIMO BC case, see 

m. 

Non-symmetric setting. The approach can be extended 
beyond the symmetric setting, to find achievable rate pairs 
(RifRz). Suppose that the ratio between the singular- values- 
product of Hi and H 2 is p, then we can work at each of 
the parallel networks with ratio p x / Nr . If this quantity is not 
integer, then there is a tradeoff between rounding and noise 



amplification Q. However the loss can be bounded as in |4j. 
Furthermore, we may shape the ratios between the diagonals, 
such that all the ratios are integers, except for maybe one, 
or alternately, if the decomposed matrices satisfy a certain 
majorization condtion, it is possible to take all the ratios to be 
equal to 1 except for one, see ifTol for details. 

Application: Colored two-way relay channel. In the 

special case where the channel matrices are diagonal, the 
problem is equivalent to a single-antenna colored channel 
with piecewise-constant spectrum; by increasing the matrix 
dimension, arbitrary spectra can be accommodated for. Inter- 
estingly, the precoding operation at the terminals (TTZt is in the 
frequency domain in this case, unlike traditional time-domain 
Tomlinson precoding. 

Coding for the broadcast section. The only assumption 
we used regarding the BC section, is that it has a common- 
message capacity high enough such that it does not limit 
performance. However, it seems likely that the links to the 
terminal will be wireless MIMO ones as well. In that case, 
depicted also in Figure [lb] the complexity may be consid- 
erably reduced by using the scheme which is based upon the 
decomposition of Theorem[2] for that section as well; see |[Tol . 
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